Guided Constrained Policy Optimization for Dynamic Quadrupedal Robot Locomotion

نویسندگان

چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Guided Optimization for Balanced Locomotion

Teaching simulated creatures how to walk and run is a challenging problem. As with a baby learning to walk, however, the task of synthesizing the necessary muscle control is simplified if an external hand to assist in maintaining balance is provided. A method of using guiding forces to allow progressive learning of control actions for balanced locomotion is presented. The process has three stag...

متن کامل

Locomotion Gait Optimization for a Quadruped Robot

Legged robot gait generation is a challenging task that involves the control of a large number of degrees of freedom (DOF’s) within a mechanical structure that varies during locomotion. A large number of motion parameters have to be considered in order to obtain a stable, natural and efficient locomotion. Legged robot locomotion applies nonlinear dynamical equations of high order with a multidi...

متن کامل

High speed locomotion for a quadrupedal microrobot

Research over the past several decades has elucidated some of the mechanisms behind high speed, highly efficient and robust locomotion in insects such as cockroaches. Roboticists have used this information to create biologically-inspired machines capable of running, jumping, and climbing robustly over a variety of terrains. To date, little work has been done to develop an at-scale insect-inspir...

متن کامل

Machine Learning for Fast Quadrupedal Locomotion

For a robot, the ability to get from one place to another is one of the most basic skills. However, locomotion on legged robots is a challenging multidimensional control problem. This paper presents a machine learning approach to legged locomotion, with all training done on the physical robots. The main contributions are a specification of our fully automated learning environment and a detailed...

متن کامل

Constrained Policy Optimization

For many applications of reinforcement learning it can be more convenient to specify both a reward function and constraints, rather than trying to design behavior through the reward function. For example, systems that physically interact with or around humans should satisfy safety constraints. Recent advances in policy search algorithms (Mnih et al., 2016; Schulman et al., 2015; Lillicrap et al...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Robotics and Automation Letters

سال: 2020

ISSN: 2377-3766,2377-3774

DOI: 10.1109/lra.2020.2979656